Skip to content

keep update#25

Merged
junjun315 merged 144 commits intojunjun315:developfrom
PaddlePaddle:develop
Jun 12, 2019
Merged

keep update#25
junjun315 merged 144 commits intojunjun315:developfrom
PaddlePaddle:develop

Conversation

@junjun315
Copy link
Owner

No description provided.

sneaxiy and others added 30 commits May 27, 2019 21:13
* add Concat quantization
add unit test for quantizing concat
fix for wrong value when the input is not in map of calculated scales
add use_quantizer to concat_op.cc
add scale_algo rules for concat

test=develop

* missing fix for multiple inputs quantize-squash

* wojtuss review fix: adding comment

test=develop
* Enable sqrt operator for the nGraph Bridge.

test=develop

* Update activation_op.h
* Bias correction for exponential moving average

test=develop, test=document_preview

* Fix docs

test=develop, test=document_preview
#17371)

* fix the api example for create_global_var, create_parameter, SGDOptimizer, RMSPropOptimizer, MomentumOptimizer, LarsMomentumOptimizer, FtrlOptimizer
test=develop

* add example for adamoptimizer
fix API.spec
test=develop

* test=develop

* test=develop
* [DOC] Fix api docs, test=develop, test=document_preview

* [DOC] Fix api annotation: fluid.layers.tensor_array_to_tensor. test=develop, test=document_preview

* test=develop, test=document_preview

update MD5 of tensor_array_to_tensor
* - changes to graph detector

- Changes to pass

- Added ut for new pass

- use_pass

- Added pass to mkldnn passes

- fix to registration

- improved verbose messaging for conv bias passes

- Lint fixes

test=develop

* - Lint fixes

test=develop
…17570)

* add INT8 conv+relu6 fuse and enbale mobilentv2 INT8 test
test=develop

* change fasle and 0.0 to fuse_brelu and brelu_threshold
test=develop

change the "fuse_relu||fuse_brelu" to "unsigned_output"
test=develop

* Use relu instead of brelu as INT8 post-op because INT8 brelu is not enabled in mkldnn v0.18
test=develop

* continuous-integration fix
test=develop
* update python APIs

test=document_preview test=develop

* update API.spec

test=document_preview

test=develop

* update merge_selected_rows

* update API.spec

test=document_preview

test=develop

* update API.spec

test=document_preview

test=develop

* fix the comment of less_than

test=develop

test=document_preview

* update API.spec

test=develop

test=document_preview

* update API.spec

test=develop

test=document_preview

* update API.spec

test=develop

test=document_preview

* update API.spec

test=develop

* update API

test=develop
…er of inputs/outputs is 2 (#17415)

* Optimize the concat and split kernel for special cases that the number of inputs/outputs is 2.
test=develop

* Refine codes.
test=develop

* Correct the condition.
test=develop

* Move the define of tmp_data outside the if statement.

* Print the cudnn minor version.
test=develop

* Fix the case when in_num/o_num is 1 in concat/split op.
test=develop

* Remove const_cast.
test=develop
* add gradient clip in minimize; test=develop

* fix bug; test=develop

* fix format; test=develop

* move new grad clip to dygraph/grad_clip.py; test=develop

* fix lr decay and grad clip test; test=develop

* seperate dygraph grad clip; test=develop

* fix grad clip test; develop

* fix api spec bug; test=develop

* add blank line, test=develop,test=document_preview

to fix format problem
* fix doc in transpiler, test=develop
* add depthwise_conv2d test=develop

* use cpu for ngraph test=develop
* test=develop, add ocr in dygraph test

* test=develop, add cudnn determinist

* test=develop, remove useless code

* test=develop, fix cmake error
* fix:
1. infernce multi card occupy
2. facebox model inference occupy too much
test=develop

* refine python api comments: shuffle, while, scale, sampled_softmax_with_cross_entropy, scatter,
round, sin, sqrt, shape, split, soft_relu, slice, selu, ifelse, switch.
test=develodp

* fix conflict error.
test=develop
* Enable less_than ngraph operator

test=develop

* Added compare unit-tests test=develop

* Update: date && removed import test=develop
zhhsplendid and others added 29 commits June 10, 2019 15:01
This change doesn't effect CI task because CI docker config build
was done in another way.

test=develop
* Enable seq_pool op to accept len 0 input

test=develop

* Update sequence_pool's api

test=develop

* Add more unittest cases for seq_pool op

test=develop

* Remove legacy comments

test=develop

* Don't use template in op maker

test=develop
* remove attribute in Allocator::Allocate, test=develop

* fix travis ci error, test=develop
1. Enhence auto pruning. 
2. Fix save/load graph when using py_reader.
test=develop
TODO: it is a temporary fix for Paddle release 1.5. We have to fix
this failed unit test soon.

test=develop
fix doc: conv3d, layer norm, bilineartensorproduct, conv2dtranspose, conv3dtranspose,rowconv, groupnorm, spectralnorm, treeconv

hidden api: sequenceConv
* fix: when use the load model from memory mode, the RAM occupy is high

test=develop

* ligth mem reuse
test=develop

* fix cpplint
test=develop
fix deconv2d bug for cycle_gan
Add Pipeline Concurrency Train Mode:
- Cpp: pipeline_trainer & section_worker
- Python: PipelineOptimizer
- Add a new data_feed type: PrivateInstantDataFeed
- Add a test demo of pipeline trainer and the test model is gnn
- Do not support win32 now
* update anakin-engine interfaces for content-dnn

test=develop

* support only-gpu mode of Anakin

modify eltwise parse

test=develop

* modification for thread-safe

test=develop

* Integrated template instance

test=develop

* increase template parameters

test=develop

* support MLU predictor

test=develop

* update anakin cmake files

test=develop

* update TargetWrapper::set_device

* update the initialization of anakin subgraph

test=develop

* use the default constructor of base class

test=develop
* - removed is_reusing_

* - Added TID to keys for reusing apart from softmax PD

* - compilation fix

* - Yet another compilation fix

* - Batch Norm and Conv adapted

* - Fix to softmax MT

* - Fixes to MT code of MKL-DNN

* - Lint fixes

test=develop
* add 'UserDefinedRoleMakerNCCL' for collective mode.

* code style

* add the name UserDefinedRoleMakerNCCL to __all__

* rename to UserDefinedRoleMakerCollective

* rename to UserDefinedCollectiveRoleMaker
* add unfold op
test=develop

* fix divide bug in python3 when calculating output width and height
test=develop

* add name=None in python api, move redundant code into inline function

* try to trigger ci for this code
test=develop
* add deformable psroi pooling

* test=develop

* test=develop

* test=develop
modify format

* fix bug

* test=develop run ci

* test=develop
add API.spec

* add test_layers.py

* run ci again

* test=develop
run ci again

* run ci again

* test=develop
run ci again

* test=develop
run ci again

* test=develop
run ci again

* add space between two lines

* test=develop
add space between two lines

* test=develop
add space between lines

* test=develop
modify comment in nn.py

* test=develop
add space between two lines

* test=develop
add space between two lines

* update API.spec

* run ci again

* test=develop
run ci again

* rerun ci

* test=develop
rerun ci

* change input shape

* run ci

* test=develop
run ci

* modify format of nn.py

* test=develop

* test=develop

* test=develop
update API.spec

* test=develop
fix API doc

* modify API comment

* modift API comment

* test=develop
update API.spec

* test=develop
modify comment

* test=develop
modift comment

* test=develop
modift comment

* test=develop
update API.spec

* test=develop
modify comment

* test=develop
add inference in nn.py

* test=develop
update API.spec

* test=develop
resolve confict

* test=develop
update API.spec
* add shape not match doc to data layer, test=develop

* fix API.spec md5
test=develop
* fix logging unable. test=develop

* unset sys.stdout for stream handler. test=develop

* fix newly add basicConfig. test=develop

* fix import error. test=develop
* fix im2sequence padding bug, test=develop

* fix edit_distance, test=develop

* add API.spec,test=develop
Fix bug in sequence_unpad op, when allocate the output memory do not match actual memory, check memory failed. Fix this bug by allocating the output memeory in correct code position.
* Cherry-pick fix random Python3 CI failure.

In some tests, SWEs used "print('xxx').format('xxx')". The syntax
is only supported in Python2, not python3. However, since those
lines are related to data download, if the CI machines already have
the data, it passes CI tests. That causes random failure.

* Cherry-pick: disable CUDNN case of test_warpctc_op

Also temporary disable a unit test. The test will be fixed under high priority.
Repair error prompt: Users are prompted to check whether the model or parameter files are damaged when loading parameters are wrong.
@junjun315 junjun315 merged commit 15ac1c6 into junjun315:develop Jun 12, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.